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In the Claims: 

The following listing of claims replaces all prior versions and listings of claims in the case. 

1 . (Currently Amended) A computerized method for optimizing a nucleotide sequence for the expression of a 

protein on the basis of the amino acid sequence of the protein, which comprises the following steps : 

generating, using a suitably programmed computer, a first test sequence of n codons which 

correspond to n consecutive amino acids in the protein sequence, where n is a natural number 
and is less than or equal to N, the number of amino acids in the protein sequence, 

specifying, using [[a]] die suitably programmed computer, m optimization positions in the test 
sequence which correspond to the position of m codons at which the occupation by a codon, 
relative to the test sequence, is to be optimized, where m < n and m < N, 

generating, using [[a]] the suitably programmed computer, one or more further Icsl sequences from 
the first test sequence by replacing at one or more of the m optimization positions a codon of 
the first test sequence by another LX)d()n which expresses the same amino acid, 

assessing, using [[a]] the suitably programmed computer, each of the test sequences with a quality 
function and ascertaining the test sequence which is optimal in relation to the quaUty 
function, 

specifying, using [[a]] die suitably programmed computer, p codons of the optimal test sequence 
which are located at one of the m optimization positions, as result codons which form the 
codons of the optimized nucleotide sequence at the positions which corresponds to the 
position of said p codons in the test sequence, where p is a natural number and p < m, 

iterating the preceding steps, where in each iteration step the test sequence comprises the 

appropriate result codon at the positions which correspond to positions of specified result 
codons in the optimized nucleotide sequence, and the optimization positions are different 
from positions of result codons. 

2. (Previously Presented) The method as claimed in claim 1, wherein in one or more iteration steps the m 

optimization positions of the test sequences directly follow one or more result codons which have been 
specified as part of the optimized nucleotide sequence. 



3 



David Raab et al. 
Application No. 10/539,208 

3. (Previously Presented) The method as claimed in claim 1, wherein in one or more iteration steps the p 

codons which are specified as result codons of the optimized nucleotide sequence are p consecutive 

codons. 

4. (Previously Presented) The method as claimed in claim 1, wherein in one iteration step test sequences with 

all possible codon occupations for the m optimization positions are generated from the first test sequence, 
and the optimal test sequence is ascertained from these test sequences. 

5. (Previously Presented) The method as claimed in claim 1, wherein: 

each test sequence is assessed with a quality function, 

an extreme value is ascertained within the values of the quality function for all partial sequences 

generated in an iteration step, 
p codons of the test sequence are specified which corresponds to the extreme value of the weight 

function as result codons at the appropriate positions, where p is a natural number and p<m. 

6. (Previously Presented) The method as claimed in claim 5, wherein the quality function takes account of one 

or more of the following criteria: 

codon usage I'or a prcdcl'incd organism. CiC content, repetitive sequences, secondary structures, inverse 
complementary sequence repeats and sequence motifs. 

7. (Previously Presented) The method as claimed in claim 6, wherein the quality function is a function of 

various single terms which in each case assess one criterion Froiii the following list of criteria: 

codon usage for a predefined organism, GC content, sequence motifs, repetitive sequences, secondary 

structures, inverse complementary sequence repeats. 

8. (Previously Presented) The method as claimed in claim 1, wherein the quality function takes account of one 

or more of the following criteria: 

exclusion of inverse complementary sequence identities of more than 20 nucleotides to the transcriptome 

of a predefined organism, 

exclusion of homology regions of more than 100 base pairs to a predefined DNA sequence, 

exclusion of homology regions with more than 90% similarity of the nucleotide sequence to a predefined 

DNA sequence. 

9. (Previously Presented) The method as claimed in claim 1, further comprising S5aithesizing the optimized 

nucleotide sequence. 
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10. (Previously Presented) The method as claimed in claim 9, wherein the step of synthesizing the optimized 

nucleotide sequence takes place in a device for automatic synthesis of nucleotide sequences which is 
controlled by the computer which optimizes the nucleotide sequence. 

1 1 . (Currently Amended) A device for optimizing a nucleotide sequence for the expression of a protein on the 

basis of the amino acid sequence of the protein, which has a computer unit comprising algorithms, 
including: 

an algorithm for generation of a first Icsl sequence of n codons which correspond to n consecutive 

amino acids in the protein sequence, where n is a natural number and is less than or equal to 

N, the number of amino acids in the protein sequence, 
an algorithm for specification of m optimization positions in the test sequence which correspond to 

the position of m codons at which the occupation by a codon, relative to the test sequence, is 

to be optimized, where in m < n and m < M N, 
an algorithm for generation of one or more further test sequences from the first test sequence by 

replacing at one or more of the m optimization positions a codon of the first test sequence by 

another codon which expresses the same amino acid, 
an algorithm for assessment of each of the test sequences with a quality function and for 

ascertaining the test sequence which is optimal in relation to the quality iiinction, 
an algorithm for specification of p codons of the optimal test sequence which are located at one of 

the ift m optimization positions, as result codons which form the codons of the optimized 

nucleotide sequence at the positions which coixespond to the positions of said p codons in the 

test sequence, where p is a natural number and p < m, 
an algorithm for iteration of the steps of generation of a plurality of test functions, of assessment of 

the test sequences and of specification of result codons, where in each iteration step the test 

sequence comprises the appropriate result codon at the positions which correspond to 

positions of specified result codons in the optimized nucleotide sequence, and the 

optimization positions are different from positions of result codons. 

12. (Previously Presented) The device as claimed in claim 11, further comprising an algorithm for carr5dng out 

the steps of a method as claimed in claim 1 . 

13. (Previously Presented) The device as claimed in claim 11, further comprising a device for automatic 

synthesis of nucleotide sequences which is controlled by the computer in such a way that it S5aithesizes the 
optimized nucleotide sequence. 
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14. (Previously Presented) A computer program product comprising instructions encoded on a non-transitory 

computer readable medium, wherein the instructions are executable by a computer to cause the computer 
to carry out a method as claimed in claim 1 . 

15. (Previously Presented) The computer program product as claimed in claim 14, where the instructions can, 

when executed by a computer, cause a device for the automatic synthesis of nucleotide sequences to 
prepare the optimized nucleotide sequence. 

16. (Cancelled) 

17. (Withdrawn) A nucleic acid which includes a nucleotide sequence coding for a protein and which is 

obtainable by a method as claimed in claim 9. 

18. (Withdrawn) The nucleic acid as claimed in claim 17, characterized in that the latter includes a nucleotide 

sequence which codes in a predefined organism for a protein, where said nucleotide sequence is not 
present in the naturally occurring genome of the organism. 

19. (Withdrawn) The nucleic acid as claimed in claim 18, characterized in that the organism is selected from the 

following group: 

viruses, especially vaccinia viruses, 

prokaryotes, especially Escherichia, Caulobacter cresentus. Bacillus subtiUs, Mycobacterium spec, 
yeasts, especially Saceharomyces cerevisiae, Schizosaccharomyces pombe, Pichia pastoris, Pichia 
angusta, 

insects, especially Sprodoptera frugiperda, Drosophila spec, 

mammals, especially Homo sapiens, Macaca mulata, Mus musculus, Dos taurus, Capra hircus, 

Ovis aries, Oryctolagus cuniculus, Rattus norvegicus, Chinese hamster ovary, 
monocotyledonous plants, especially Oryza sativa, Zea mays, Triticum aestivum, 
dicotyledonous plants, especially Glycin max. Gossypium hirsutum, Nicotiana tabacum, 
Arabidopsis thaliana, Solanum tuberosum. 

20. (Withdrawn) The nucleic acid as claimed in claim 4-17, characterized in that the protein encoded by the 

nucleotide sequence is one of the following proteins and/or falls into one of the following protein classes: 

enzymes, especially polymerases, endonucleases, 

Ugases, ligases, Ugases, proteases, kinases, phosphatases, topoisomerases, cytokines, chemokines, 
transcription factors, oncogenes, 
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proteins from thermophilic organisms, from cryophilic organisms, from halophihc organisms, from 

acidophilic organisms, from basophilic organisms, proteins with repetitive sequence elements, 

especially structural proteins, human antigens, especially tumor antigens, tumor markers, 

autoimmune antigens, diagnostic markers, 
viral antigens, especiaUy from HAV, HBV, HCV, HIV, SIV, W, HPV, rinoviruses, influenza 

viruses, herpesviruses, pellenr-uses, polyoma viruses, hendra virus, dengue virus, AAV, 

adenoviruses, HTLV, RSV, 
antigens of disease-causing parasites, e.g. protozoa, especially those causing malaria, leishmania, 

trypanosoma, toxoplasmas, amoeba, 
antigens of disease-causing bacteria or bacterial pathogens, especially of the genera Chlamydia, 

staphylococci, Klebsiella, Streptococcus, Salmonella, Listeria, BorreUa, Escherichia coli, 
antigens of organisms of safety level L4, especially Bacillus anthracis, Ebola virus, Marburg virus, 

poxviruses. 

21. (Withdravm) The nucleic acid as claimed in claim 18, characterized in that the quaUty function takes 

account at least of one the following criteria: 
GC content, 

codon usage of the predefined organism, 

exclusion of inverse complcnicnlarv sequence identities of more than 20 nucleotides to the 

transcriptome of a predetermined organism, 
complete or substantial exclusion of homology regions of more than 100 base pairs to a predefined 

DNA sequence, 

complete or substantial exclusion of homology regions with a similarity of more than 90% to a 

predefined DNA sequence. 

22. (Withdrawn) A vector comprising a nucleic acid as claimed in claim 17. 

23. (Withdravm) A cell comprising a vector as claimed in claim 22. 

24. (Withdrawn) An organism comprising at least one cell as claimed in claim 23. 

25. (Withdrawn) A nucleic acid, in particular as claimed in claim 9, comprising a nucleotide sequence which is 

selected from the group comprising: SEQ ID NO: 2, 4. 6. 8. 

26. (Withdrawn) A vector comprising a nucleic acid as claimed in claim 25. 
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27. (Withdrawn) A cell comprising a vector as claimed in claim 26. 

28. (Withdrawn) An organism comprising at least one cell as claimed in claim 27. 

29. (Withdrawn) A cell comprising a nucleic acid as claim in claim 17. 

30. (Withdrawn) A cell comprising a nucleic acid as claimed in claim 25. 

3 1 . (Currently Amended) A computerized method for optimizing a nucleotide sequence for the expression of a 

protein on the basis of the amino acid sequence of the protein, which comprises the following: 

generating, using a suitably programmed computer, a first test sequence ol' n eodons which 

correspond to n consecutive amino acids in the protein sequence, where n is a natural number 
and is less than or equal to N, the number of amino acids in the protein sequence, 

specifying, using [[a]] die suitably prograimned computer, m optimization positionsin the test 
sequence which correspond to the position of m codons at which the occupation by a codon, 
relative to the test sequence, is to be optimized, where m < n and m < N, 

generating, using [[a]] die suitably programmed computer, one or more further test sequences from 
the first test sequence by replacing at one or more of the m optimization positions a codon of 
the first test sequence by another codon which expresses the same amino acid, 

assessing, using [[a]] die suitably programmed computer, each of the test sequences with a quality 
function and ascertaining the test sequence which is optimal in relation to the quality 
function, 

specifying, using [[a]] the suitably programmed computer, p codons of the optimal test sequence 
which are located at one of the m optimization positions, as result codons which form the 
codons of the optimized nucleotide sequence at the positions which corresponds to the 
position of said p codons in the test sequence, where p is a natural number and p < m, 

iterating the preceding steps, where in each iteration step the test sequence comprises the 

appropriate result codon at the positions which correspond to positions of specified result 
codons in the optimized nucleotide sequence, and the optimization positions are different 
from positions of result codons, and wherein after said result codons are specified in any 
iteration step, the result codons are not changed in subsequent iterations. 



32. (Previously Presented) The method according to claim 31, wherein at least some of the m optimization 
positions are connected and form a variation window on which codon occupation is to be varied. 
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33. (Previously Presented) The method according to claim 32, wherein the variation window in one iteration 
step overlaps with the variation window of a preceding iteration step. 

34. (Previously Presented) The method according to claim 32, wherein the variation window has a size of m=3 
to m=20. 

35. (Previously Presented) The method according to claim 34, wherein the variation window has a size of m=5 

to m=10. 

36. (New) A computerized method for optimizing a nucleotide sequence for the expression of a protein on the 

basis of the amino acid sequence of the protein, which comprises the following steps performed on one or 
more suitably programmed computers: 

generating a first test sequence of n codons which correspond to n consecutive amino acids in the 

protein sequence, where n is a natural number and is less than N, the number of amino acids 

in the protein sequence, 
specifying m optimization positions in the test sequence which correspond to the position of in 

codons at which the occupation by a codon. relative to the test sequence, is to be opliiiii/.cd, 

where ni < n and ni < N, 
generating one or more further test sequences from the first test sequence by replacing at one or 

more of the m optimization positions a codon of the first test sequence by another codon 

which expresses the same amino acid, 
assessing each of the test sequences with a quahty function and ascertaining the test sequence 

which is optimal in relation to the quality function, 
specifying p codons of the optimal test sequence which are located at one of the m optimization 

positions, as result codons which form the codons of the optimized nucleotide sequence at the 

positions which corresponds to the position of said p codons in the test sequence, where p is a 

natural number and p < m, 
iterating the preceding steps, where in each iteration step the test sequence comprises the 

appropriate result codon at the positions which correspond to positions of specified result 

codons in the optimized nucleotide sequence, and the optimization positions are different 

from positions of result codons. 
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37. (New) A computerized method for optimizing a nucleotide sequence for the expression of a protein on the 
basis of the amino acid sequence of the protein, which comprises the following performed on one or more 
suitably programmed computers: 

generating a first test sequence of n codons which correspond to n consecutive amino acids in the 

protein sequence, where n is a natural number and is less than N, the number of amino acids 

in the protein sequence, 
specifying m optimization positionsin the test sequence which correspond to the position of m 

codons at which the occupation by a codon, relative to the test sequence, is to be optimized, 

where m < n and m < N, 
generating one or more further test sequences from the first test sequence by replacing at one or 

more of the m optimization positions a codon of the first test sequence by another codon 

which expresses the same amino acid, 
assessing each of the test sequences with a quahty function and ascertaining the test sequence 

which is optimal in relation to the quality function, 
specifying p codons of the optimal test sequence which are located at one of the m optimization 

positions, as result codons which form the codons of the optimized nucleotide sequence at the 

positions which corresponds to the position of said p codons in the test sequence, where p is a 

natural number and p < m, 
iterating the preceding steps, where in each iteration step the test sequence comprises the 

appropriate result codon at the positions which correspond to positions of specified result 

codons in the optimized nucleotide sequence, and the optimization positions are different 

from positions of result codons, and wherein after said result codons are specified in any 

iteration step, the result codons are not changed in subsequent iterations. 



10 



